-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add basic AVRO files (translated copies of the parquet testing files to avro) #62
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pitrou / @kiszk do you have any concerns or suggestions for adding several smaller AVRO files into the testing repository? They are used for apache/datafusion#910 and we may consider adding avro support to the main apache-rs repo as well.
This seems fine to me, but can you add a README explaining what these files are and how they were obtained? |
Looks good to me |
It would be possible to use arrow-python and fastavro to achieve the same, I just have a lot of Spark experience and I prefer typed so I went that way. |
Thanks @Igosuki ! I am sorry for the delayed response -- I am catching up from being on vacation and hope to help push your contributions over the line real soon now |
All good 👍 |
N.B. : I used spark for the translation so there is some additional metadata in the files, but they can be removed.